An Exploratory Study On The Appropriateness Of Latent Dirichlet Allocation For Automatic Discovery Of Product Associations From User-Generated Content
نویسندگان
چکیده
Latent Dirichlet Allocation (LDA) is a method that can be used to generate word association networks from unstructured text documents. However, no study has yet examined the applicability of LDA for deriving product associations from user-generated content. In this work, we apply LDA on 9,529 unstructured and uncategorized McDonald’s product reviews that were crawled from a German online review platform. We evaluate the applicability of LDA for deriving product associations from user-generated content. For this reason, we conducted a survey among 95 Information Systems undergraduate students about their associations with 17 McDonald’s-related nouns. Results indicate that LDA is a valid method for deriving product associations from user-generated content.
منابع مشابه
Sex, drugs, and violence
Automatically detecting inappropriate content can be a difficult NLP task, requiring understanding context and innuendo, not just identifying specific keywords. Due to the large quantity of online user-generated content, automatic detection is becoming increasingly necessary. We take a largely unsupervised approach using a large corpus of narratives from a community-based self-publishing websit...
متن کاملAutomatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation
Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...
متن کاملUsing Latent Topic Features for Named Entity Extraction in Search Queries
Search is one of the most quickly growing applications in the mobile market. As people rely more on portable devices for performing search, it becomes increasingly important to analyze user queries in order to achieve more targetted results over a broad set of search entities. While most previous work has relied on lexico-syntactic features and handcrafted knowledge sources, this paper investig...
متن کاملAn Approach to Discovery and Re-ranking of Educational content from the World Wide Web using Latent Dirichlet Allocation
With tremendous increase in the amount of digital data available educators are forced to author content for learning and teaching for use in their classes. With that there has emerged a need to facilitate automatic discovery of learning resources from the World Wide Web. In this work, we present a novel approach for discovering content from the web for e-learning. We argue that for an e-learnin...
متن کاملExploring Latent Semantic Factors to Find Useful Product Reviews
Online reviews provided by consumers are a valuable asset for e-Commerce platforms, influencing potential consumers in making purchasing decisions. However, these reviews are of varying quality, with the useful ones buried deep within a heap of non-informative reviews. In this work, we attempt to automatically identify review quality in terms of its helpfulness to the end consumers. In contrast...
متن کامل